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Abstract 

Nonextensive statistics, characterized by a nonextensive parameter q, is a promising and practi- 
cally useful generalization of the Boltzmann statistics to describe power-law behaviors from physical 
and social observations. We here explore the unevenness of the first digit distribution of nonex- 
tensive statistics analytically and numerically. We find that the first-digit distribution follows 
Benford's law and fluctuates slightly in a periodical manner with respect to the logarithm of the 
temperature. The fluctuation decreases when q increases, and the result converges to Benford's law 
exactly as q approaches 2. The relevant regularities between nonextensive statistics and Benford's 
law are also presented and discussed. 
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I. INTRODUCTION 



The foundation of statistical physics was established at the end of 19th century by 



Maxwell, Boltzmann, and Gibbs 



4|. The celebrated Maxwell-Boltzmann-Gibbs statis- 
tics underpins revolutionary concepts such as ergodicity and stosszahlansatz, which means 
that all admissible microstates are equiprobable over a long period of time. Cooperative 
effects and nonlinear dynamics are important in leading to sufficient statistics. The Boltz- 
mann distribution has a distinguished exponential behavior proportional to e~ l3E , where E 
is the energy, and = 1/ksT is the reciprocal of the temperature T with the Boltzmann 
constant ks- 

Recently, it was found that the Boltzmann statistics follows a peculiar digit law 5], 
named Benford's law, which is also called the first-digit law or the significant digit law 0, 7]. 
The law states that the occurrence of the nonzero leftmost digit, i.e., 1,2, ...,9, of numbers 
from many real world sources is not uniformly distributed as one might naively expect, but 
instead, the nature favors smaller ones according to a logarithmic formula, 

P Bcn (d) = log 10 ( 1 + i), d = 1, 2, 9, (1) 



d / 

where PBen(^) is the probability of a number having the first digit d. The first-digit distri- 
bution of the Boltzmann statistics fluctuates slightly around Benford's law in a periodical 
manner with respect to the logarithm of the temperature of the system p]. Moreover, two 
quantum distributions, i.e., the Bose-Einstein statistics and the Fermi-Dirac statistics, are 
also proven to comply with Benford's law in analogous manners 

On the other hand, the natural and social numbers are not always discovered to follow 
the exponential Boltzmann distribution, other than that, many sources are actually found 
to demonstrate power-law behaviors. In 1988, Tsallis proposed an entropy formula with a 



nonextensive parameter q to describe the prevalent power- law behaviors |8Ml l| . In the limit 
of q — > 1, it recovers the familiar Boltzmann statistics. This novel statistics is named the 
Tsallis statistics, or nonextensive statistics. It has several well-defined mathematical rigidi- 
ties and performs impressively well in various practical domains. Empirically, the Tsallis 
statistics is widely employed to explain data from nonextensive viewpoints in various aspects 
in physics, chemistry, economics, computer science, biology, cellular automata, self-organized 



criticality, scale-free networks, linguistics, and other sciences (see Chap. 7 in Ref. lj| and 



references therein for extensively concrete examples). Theoretically, the nonextensiveness 



can be explained in terms of fluctuations of temperature [121 ]. or superstatistics (l3j. The 
physics behind nonextensiveness is believed to be long-range correlations, strongly quan- 
tum entanglements, and deformed phase space due to insufficient statistics in space and/or 



time jSHllI 



For the comprehensive existence of the Tsallis statistics, it appears intriguing to look 
into its digit distribution, besides the studied canonical ones (5)]. The influence from the 
nonextensive parameter q on results may reveal further regularities of natural statistics. We 
study the first-digit distribution of nonextensive statistics in detail both analytically and 
numerically. We find that, analytically, in the range of 1 < q < 2, it slightly fluctuates around 
the first-digit law in a periodical manner with respect to the logarithm of the temperature. 
The deviation from Benford's law is diminutive. As q varies monotonously from 1 to 2, the 
amplitude of fluctuation becomes smaller and smaller, and in the limit of q — y 2, nonextensive 
statistics conforms to Benford's law exactly. Hence, it explains the underlying reason why 
many sources from systems with nonextensiveness respect the first-digit law in an almost 
precise way. 

The paper is organized as follows. In Sec. II, nonextensive statistics is briefly reviewed 
and the normalized probability density is presented. In Sec. IHIt we make the analytical and 
numerical comparisons between the first-digit distribution of nonextensive statistics and the 
significant digit law. Then, several relevant insights are discussed in Sec. IIVt including scale 
invariance, base invariance, and mantissa distribution. Section |V] summarizes the results of 
the paper. 

II. NONEXTENSIVE STATISTICS 

Inspired by the probabilistic description of multifractal geometries, Tsallis postulated a 
possible generalization of entropy js], 

S q = k B qeK, (2) 

q — 1 

where q characterizes the nonextensiveness of the considered system and {pi} are the prob- 
abilities associated with W £ Af microscopic configurations, satisfying Y^LiPi ~ 1- I n 
the limit of q — > 1, S q elegantly recovers the conventional Boltzmann entropy, S q -+i = 
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The entropy in Eq. (jSJ) can be rewritten with the help of q algebra 10|, , 

W WW 

Sg = k B ^2piln q {l/pi) = -k B y]Pila-qPi = -A^y] pi ln 2 - g p», (3) 

i=l i=l i=l 

where the g-logarithmic function is defined as 

x i-<? _ i 

\n q x = - - - , q G K, (4) 

and when q approaches 1, ln ? _>.i x = lnx. 

In contrast to the conventional entropy, S q is non-additive for two independent subsystems 
A and B when q 1, 

S q (A + B) = <^4) + 5^(5) _ 5^4) ^(5) _ ( 
&b fee ks ks ks 

It is clearly seen that the deviation of q from 1 reflects the nonextensiveness of relevant 
systems. 

Through the entropic maximizing procedure, the distribution function f q (E;f3) can be 



obtained 18 



111, 



f q (E; (3) oc ef E = [1 - (1 - q)(3E]—, , (6) 
where the g-exponential function is defined as 

e x q = [1 + (1 - q) x ]T=* , l + (l-q)x>0. (7) 

Note that when q approaches 1, f q (E; (3) returns to the standard Boltzmann distribution 
proportional to e~_^f = e~^ E . 

After normalization to unit, f q (E; (3) is written as 

f q (E;/3) = (3{2-q)-[l-{l-q)pE}^, 1 < q < 2. (8) 

For ranges other than 1 < q < 2, we are not considering in this paper for reasons listed 
below. 

(i) When q < 1, there exists an upper limit for the energy, -E upP er = [(1 — l)^ 1 = 
ksT/il — q), whose physical meaning is not well understood yet. 

(ii) While q > 2, f q (E; (3) cannot be normalized, because the power 1/(1 — q) becomes 
larger than —1. 
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III. THE FIRST-DIGIT DISTRIBUTION OF TSALLIS STATISTICS 



With the knowledge of nonextensive statistics well-prepared from the above section, we 
are going to explore its first-digit distribution P q (d; 0), which now depends on the nonexten- 
sive parameter q, besides the temperature T. Utilizing the language of probability density, 
the likelihood to have the first digit d equals [5|] 

00 Hd+1)-10" 

P q (d;P)= / f q (E;P)dE. (9) 

n=-oc Jd - 10n 

Combining Eqs. (JSJ) and (jHJ), we can get P g (d; (3) straightforward, 

P q {d;f3) = {[l-10 n /3c/(l-g)]^-[l-10 n /3(rf+l)(l-g)]^|. (10) 

n=— 00 

A nice property of the Boltzmann-Gibbs statistics and the Fermi-Dirac statistics, is still 
preserved 

P q (d;10f3) = P q (d;(3), (11) 

which origins from the multiplication appearance of E and f3 in Eq. OH]). As to be explained 
in Sec. IV, Eq. (fTTj) is closely related to the scale invariance of Benford's law. Therefore, we 
can define a new function [5|, 

P*(d;a) = P q (d;f3 = 10 a ), (12) 



which appears to be a one-periodical function, i.e., P*(d; a) = P q (d; a + 1). 

By analogy with the Boltzmann distribution we expand P* (d; a) into Fourier series, 

00 

P;{d-a)= J2 c n e l - 2n ™. (13) 

n=— oo 

We denote the constant component of the Fourier series of P*(d; a), cq, as a new function 
P q (d), and through nontrivial calculations resembling Ref. Js], we can get 

P q (d) = <%= f P;{d ]a )da (14) 
Jo 

/i 00 
{ I 1 - W n+a d{l - q)] & - [1 - 10 n+a {d + 1)(1 - q)] ^ } da 
n=— oo 

= / {[1 - 10 Q d(l -g)]^ 1 - [1 - 10 a (d+ 1)(1 -g)]^ 1 }da 
f°° r 9-2 ,-21 1 

= ^ |[l- U d(l-g)]^_[l_ u ( d+ l)(l_ g )]^j__ dw , 



P q (d=1;)8) 




FIG. 1: (Color online). The probability of the Tsallis statistics with the first digit d = 1, i.e., 
P q (d = l;/3), versus the logarithm of the reciprocal of the temperature /3 = 1/ksT and the 
nonextensive parameter q. 

where a substitution u = 10" is adopted. Now, the differential equation of P q (d) with respect 
to d is rather concise, 

W-sssCstt-s)- < 15 > 

By utilizing the normalization condition, Y^=iPq(d) = 1? we attain 

P q (d) = log 10 f 1 + ~) = P Bc „(rf), (16) 

which turns out to be the first-digit law exactly, in analogy with three canonical statistics jsj. 
Therefore, we conclude that, at a fixed q, the first-digit distribution of nonextensive statistics 
fluctuates around the significant digit law periodically, with respect to a, or equivalently, 
the logarithm of (3. Furthermore, we stress that, the central value, i.e., Cq, is independent of 
the nonextensive parameter q. In contrast, as we will see later, the strength of fluctuation 
depends on q. 

Numerically, the probability of owning the first digit d = 1, i.e., the function P q (d = 1; f3), 
is illustrated in Fig. [TJ The cross section of the figure at q — 1 is the very case for the 
Boltzmann statistics {J. The maximum amplitude to deviate from Benford's law is less 
than 0.03 when q = 1 [5]. For g / 1, from the figure, we see that the deviation is even 



P q (d=5 ;j e) P q (d=9;/0 




FIG. 2: (Color online). The probabilities of the Tsallis statistics with first digit d = 5 (left) 
and d = 9 (right), versus the logarithm of the reciprocal of the temperature j3 = 1/ksT and the 
nonextensive parameter q. 

smaller and decreases monotonously with increasing q. In the limit of q — > 2, we prove 
in Sec. IV analytically that the digit distribution of nonextensive statistics complies with 
Benford's law exactly. 

The probabilities for digits other than digit 1 behave similarly. In Fig. [2j we depict 
the probabilities of possessing the first digit as d = 5 and d = 9 as examples, versus the 
nonextensive parameter q and the logarithm of j3. The central values follow exactly Benford's 
law. The deviations from Benford's law for digits 2-9 are smaller than that for digit 1. Here 
for 5 and 9, their deviations are both less than 0.01 for q — 1 [5], and further smaller for 
q^l. 



IV. DISCUSSIONS 



Benford's law applies to numerous data from natural sources, e.g ., areas of lakes, lengths 



sars 



151 ] , hadron full widths [16| , 



of rivers [7j , physical constants [14| , various quantities of pu 
complex atomic spectrum 171 ]. and a-decay half lives |18 
statistics performs successfully at describing ubiquitous power-law behaviors [ll|, and pos- 



19J . Meanwhile, nonextensive 



sesses elegantly conceptual merits as well 



4ll|. Hence, the regularities between Benford's 



law and nonextensive statistics, especially, the influence from the nonextensive parameter q, 
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appear as an amusing and significative issue. 

Generally, q is explained as a parameter reflecting microscopic mechanism of deforming 
phase space or fluctuating the temperature of the system, however unfortunately, due to the 
imperfect understanding of the dynamical details of generating nonextensiveness, it cannot 
be determined as a prior nowadays. A practical way to obtain q is through numerical 
fittings to Eq. (jSJ). Our results might point to another possible way to access q by counting 
the occurrence of the first digits and investigating their deviations from Benford's law, which 
costs less time than conventional fitting procedure. 

Now in the following, let us discuss three properties of the first digit law related to the 
studies presented in this paper. 



First, Benford's law is scale invariant, which is discovered by Pinkham in 1961 [20j. This 
property means that the law remains unchanged under numerical rescalings, hence does not 
depend on any particular choice of units. As mentioned, in our study, it is closely related 
to Eq. ( fTTj) . which is ascribed to the multiplication appearance of and E. The rescaling 
of j3 is equivalent to inversely rescaling E. Therefore, concerning the scale invariance of 
Benford's law, we expect that Eq. (1TTI) emerges naturally. Worthy to note that, Benford's 



law is also 
base b [2]], 



Dase invariant, which means that the regularity of the law is independent of the 
22| | . In the binary system (b = 2), octal system (6 = 8), or other base system, 



the data, as well as in the decimal system (b = 10), all fit the general Benford's law, 

P Ben (d) = log, ( 1 + 3V d = 1, 2, b - 1. (17) 



Second, the independence of P q (d) on q in Eq. f|T6|) breaks out as an astonishing result 
at first sight. However, it was proven by Smith by utilizing the language of digital signal 
processing in his textbook that, in reference to the logarithmic coordinates, the digit distri- 
bution of any probability distribution function should fluctuate around Benford's law with 
respect to the rescaling parameter 23(|. Therefore, the q- independence is expected. As for 



our study, as stated above, the dependence on (3 is equivalent to the rescaling of E. There- 
fore, the oscillation of the digit distribution around Benford's law is consistent with Smith's 
analysis from the viewpoint of rescaling. 

The last point we would like to present is the proof of the coincidence of the first-digit 
distribution of nonextensive statistics and the significant digit law in the q — > 2 limit. It can 
be ascribed to the "1/m" behavior of mantissa distribution js|. The mantissa m G [0.1, 1) 
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is the significant part of a floating-point positive number x, denned uniquely as x = m x 10 n , 
where n is an integer. It was pointed out that, if the probability density of m is distributed 
according to 1/m, then Benford's law is guaranteed Ql. Actually, the "1/m" distribution 
of mantissa is equivalent to the n-digit Benford's law [5j. For nonextensive statistics, when 
q approaches 2, f g ^ 2 (E; (3) oc 1/(1 + (3E). The integral divergence occurs when E — > +oo, 
where the behavior of f q (E; 0) is proportional to 1/E. Thus, all mantissa contribution comes 
from the E — > +oo region with probability density proportional to 1/m. Consequently, 
nonextensive statistics follows Benford's law exactly in the limit of q — > 2 accordingly. 

V. SUMMARY 

The Maxwell-Boltzmann-Gibbs statistics is one of the most celebrated achievements in 
the history of physics, and it has many implications and applications in various physical as 
well as social domains. However, there also exist numerous examples characterized by power- 
law behaviors other than the canonical exponential distribution. The power-law behavior is 
elegantly and efficiently described by nonextensive statistics with a nonextensive parameter 
q, proposed by Tsallis in 1988. 

While the Boltzmann statistics was proven to follow Benford's law Q], which states the 
uneven occurrence of the first nonzero digit, then it becomes intriguing to look into the digit 
distribution of nonextensive statistics and the dependence on the nonextensive parameter 
q. We find analytically that the first digit distribution of nonextensive statistics has similar 
behaviors as that of the Boltzmann distribution. It fluctuates slightly and periodically 
around Benford's law with respect to the logarithm of the temperature. With increasing 
nonextensiveness in the range of 1 < q < 2, the fluctuation decreases monotonously. In 
the limit of q — > 2, nonextensive statistics follows Benford's law exactly. Furthermore, 
the fluctuations of temperature in physical systems can smooth down the oscillation to the 
central value, which corresponds exactly to the first-digit law. Therefore, we reveal that, the 
frequent appearance of Benford's law in the natural and social data is theoretically expected 
in systems that follow nonextensive statistics. 
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